Holistic Shape-Based Object Recognition Using Bottom-Up Image Structures
نویسندگان
چکیده
HOLISTIC SHAPE-BASED OBJECT RECOGNITION USING BOTTOM-UP IMAGE STRUCTURES Praveen Srinivasan Jianbo Shi, Associate Professor of Computer and Information Science Object recognition performance that rivals human ability is one of the primary goals of computer vision research. While recognition may take many forms, key tasks include detection, estimating object pose, and segmenting the object from the background. This thesis explores the use of holistic shape matching for recognition using bottom-up image structures such as image segments and contours for all of these tasks. Holistic shape matching utilizes global information about object shape for matching, rather than local image features which often contain too little information to match reliably to the object model. By examining different tasks related to object recognition, we demonstrate the value of holistic shape matching in a broad range of problems, including perceptual grouping, human pose estimation, and object recognition. First, we introduce a method for perceptual grouping of contours in an image into larger groups that uses holistic shape matching to estimate the motion of image contours to a second, related image and group them according to similarties in motion. Holistic shape matching provides scoring for different motion hypotheses, and the final grouping is achieved using a min-cut graph cut to infer the cluster assignment for each contour. Secondly, we describe a method for human pose estimation using image segments that incrementally merges segments into hypotheses for increasingly larger regions of the human body. These hypotheses are verified by matching against a set of shape exemplars using a shape matching method that is articulation-invariant and incorporates holistic shape information. Lastly, we present a two-step method for automatically learning an object detector
منابع مشابه
Shape Detection by Packing Contours
SHAPE DETECTION BY PACKING CONTOURS Qihui Zhu Jianbo Shi Humans have an amazing ability to localize and recognize object shapes from natural images with various complexities, such as low contrast, overwhelming background clutter, large shape deformation and signicant occlusion. We typically recognize object shape as a whole the entire geometric conguration of image tokens and the context they a...
متن کاملShape Detection by Packing Contours and Regions
Humans have an amazing ability to localize and recognize object shapes from natural images with various complexities, such as low contrast, overwhelming background clutter, large shape deformation and significant occlusion. We typically recognize object shape as a whole the entire geometric configuration of image tokens and the context they are in. Detecting shape as a global pattern involves t...
متن کاملA Gestaltist approach to contour-based object recognition: Combining bottom-up and top-down cues
This paper proposes a method for detecting generic classes of objects from their representative contours that can be used by a robot with vision to find objects in cluttered environments. The approach uses a mid-level image operator to group edges into contours which likely correspond to object boundaries. This mid-level operator is used in two ways, bottom-up on simple edges and top-down incor...
متن کاملShape Representations for Object Recognition
The problem of object recognition has been at the forefront of computer vision research in the last decade. The most successful approaches have used mainly edgeor texture-based representations. The shape of the object outline, albeit widely used for pre-segmented objects, has found limited applicability to the detection problem in real images. The fact that shape is a truly holistic global perc...
متن کاملA Mid-Level Approach to Contour-based Categorical Object Recognition
This paper proposes a method for detecting generic classes of objects from their representative contours that can be used by a robot with vision to find objects in cluttered environments. The approach uses a mid-level image operator to group edges into contours which likely correspond to object boundaries. This mid-level operator is used in two ways, bottom-up on simple edges and top-down incor...
متن کامل